U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

SRX10386737: GSM5187979: M31-Entorhinal Cortex (RRBS); Mus musculus; Bisulfite-Seq
2 ILLUMINA (Illumina HiSeq 2500) runs: 41.9M spots, 2.1G bases, 719.4Mb downloads

Submitted by: NCBI (GEO)
Study: Characterizing the properties of bisulfite sequencing data: maximizing power and sensitivity to identify differences in DNA methylation [RRBS]
show Abstracthide Abstract
Background The combination of sodium bisulfite treatment with highly-parallel sequencing is a common method for quantifying DNA methylation across the genome. The power to detect between-group differences in DNA methylation using bisulfite-sequencing approaches is influenced by both experimental (e.g. read depth, missing data and sample size) and biological (e.g. mean level of DNA methylation and difference between groups) parameters. There is, however, no consensus about the optimal thresholds for filtering bisulfite sequencing data with implications for the reproducibility of findings in epigenetic epidemiology. Results We used a large reduced representation bisulfite sequencing (RRBS) dataset to assess the distribution of read depth across DNA methylation sites and the extent of missing data. To investigate how various study variables influence power to identify DNA methylation differences between groups, we developed a framework for simulating bisulfite sequencing data. As expected, sequencing read depth, group size, and the magnitude of DNA methylation difference between groups all impacted upon statistical power. The influence on power was not dependent on one specific parameter, but reflected the combination of study-specific variables. As a resource to the community, we have developed a tool, POWEREDBiSeq, which utilizes our simulation framework to predict study-specific power for the identification of DNAm differences between groups, taking into account user-defined read depth filtering parameters and the minimum sample size per group. Conclusions Our data-driven approach highlights the importance of filtering bisulfite-sequencing data by minimum read depth and illustrates how the choice of threshold is influenced by the specific study design and the expected differences between groups being compared. The POWEREDBiSeq tool can help users identify the level of data filtering needed to optimize power and aims to improve the reproducibility of bisulfite sequencing studies. Overall design: DNA was collected from mouse entorhinal cortex samples and methylation levels quantified using both array and RRBS technology. For the RRBS data, library pools were distributed across thirty-two HiSeq2500 (Illumina) lanes and subjected to 50 bp single-end sequencing. Randomly assigned groups 1 and 2 are used for power calculations and are not meaningful biological groups in the paper that this data is associated with.
Sample: M31-Entorhinal Cortex (RRBS)
SAMN18379196 • SRS8509835 • All experiments • All runs
Organism: Mus musculus
Library:
Instrument: Illumina HiSeq 2500
Strategy: Bisulfite-Seq
Source: GENOMIC
Selection: Reduced Representation
Layout: SINGLE
Construction protocol: Genomic DNA from all samples was isolated from the entorhinal cortex using the AllPrep DNA/RNA Mini Kit (Qiagen) RRBS libraries were prepared using the Premium RRBS kit (Diagenode) with some modifications. Libraries were checked using the High Sensitivity D1000 Screentape and 2200 TapeStation System (Agilent Technologies). Final library pools were distributed across thirty-two HiSeq2500 (Illumina) lanes and subjected to 50 bp single-end sequencing.
Experiment attributes:
GEO Accession: GSM5187979
Links:
Runs: 2 runs, 41.9M spots, 2.1G bases, 719.4Mb
Run# of Spots# of BasesSizePublished
SRR1400979421,641,0121.1G375.2Mb2021-03-21
SRR1400979520,228,6051G344.2Mb2021-03-21

ID:
13704888

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...